Software Benchmark—Classification Tree Algorithms for Cell Atlases Annotation Using Single-Cell RNA-Sequencing Data

نویسندگان

چکیده

Classification tree is a widely used machine learning method. It has multiple implementations as R packages; rpart, ctree, evtree, and C5.0. The details of these are not the same, hence their performances differ from one application to another. We interested in performance classification cells using single-cell RNA-Sequencing data. In this paper, we conducted benchmark study 22 Single-Cell RNA-sequencing data sets. Using cross-validation, compare packages’ prediction based on Precision, Recall, F1-score, Area Under Curve (AUC). also compared Complexity Run-time packages. Our shows that rpart evtree have best Precision; F1-score AUC; C5.0 prefers more complex trees; consistently much faster than others, although its complexity often higher others.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Graph-Based Clustering Approach to Identify Cell Populations in Single-Cell RNA Sequencing Data

Introduction: The emergence of single-cell RNA-sequencing (scRNA-seq) technology has provided new information about the structure of cells, and provided data with very high resolution of the expression of different genes for each cell at a single time. One of the main uses of scRNA-seq is data clustering based on expressed genes, which sometimes leads to the detection of rare cell populations. ...

متن کامل

A Graph-Based Clustering Approach to Identify Cell Populations in Single-Cell RNA Sequencing Data

Introduction: The emergence of single-cell RNA-sequencing (scRNA-seq) technology has provided new information about the structure of cells, and provided data with very high resolution of the expression of different genes for each cell at a single time. One of the main uses of scRNA-seq is data clustering based on expressed genes, which sometimes leads to the detection of rare cell populations. ...

متن کامل

I-13: Transcriptome Dynamics of Human and Mouse Preimplantation Embryos Revealed by Single Cell RNA-Sequencing

Background: Mammalian preimplantation development is a complex process involving dramatic changes in the transcriptional architecture. However, it is still unclear about the crucial transcriptional network and key hub genes that regulate the proceeding of preimplantation embryos. Materials and Methods: Through single-cell RNAsequencing (RNA-seq) of both human and mouse preimplantation embryos, ...

متن کامل

LEAP: constructing gene co-expression networks for single-cell RNA-sequencing data using pseudotime ordering

Summary To construct gene co-expression networks based on single-cell RNA-Sequencing data, we present an algorithm called LEAP, which utilizes the estimated pseudotime of the cells to find gene co-expression that involves time delay. Availability and Implementation R package LEAP available on CRAN. Contact [email protected]. Supplementary information Supplementary data are available at Bioinf...

متن کامل

A sparse differential clustering algorithm for tracing cell type changes via single-cell RNA-sequencing data

Cell types in cell populations change as the condition changes: some cell types die out, new cell types may emerge and surviving cell types evolve to adapt to the new condition. Using single-cell RNA-sequencing data that measure the gene expression of cells before and after the condition change, we propose an algorithm, SparseDC, which identifies cell types, traces their changes across conditio...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Microbiology research

سال: 2021

ISSN: ['2036-7473', '2036-7481']

DOI: https://doi.org/10.3390/microbiolres12020022